Picture for Bo Du

Bo Du

MiCU: End-to-End Smart Home Command Understanding with Large Language Model

Add code
May 31, 2026
Viaarxiv icon

ConRAG: Consensus-Driven Multi-View Retrieval for Multi-Hop Question Answering

Add code
May 27, 2026
Viaarxiv icon

Leveraging Text-to-Image Diffusion Models for Unsupervised Visual Object Tracking

Add code
May 26, 2026
Viaarxiv icon

Better, Faster: Harnessing Self-Improvement in Large Reasoning Models

Add code
May 24, 2026
Viaarxiv icon

UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs

Add code
May 12, 2026
Viaarxiv icon

Learn to Think: Improving Multimodal Reasoning through Vision-Aware Self-Improvement Training

Add code
May 12, 2026
Viaarxiv icon

LeapTS: Rethinking Time Series Forecasting as Adaptive Multi-Horizon Scheduling

Add code
May 11, 2026
Viaarxiv icon

Belief Memory: Agent Memory Under Partial Observability

Add code
May 07, 2026
Viaarxiv icon

VTAgent: Agentic Keyframe Anchoring for Evidence-Aware Video TextVQA

Add code
May 06, 2026
Viaarxiv icon

SAMe: A Semantic Anatomy Mapping Engine for Robotic Ultrasound

Add code
Apr 28, 2026
Viaarxiv icon